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Structural Comparison of the Genes of Two HLA-DR Supertypic 
Groups: The Loci Encoding DRw52 and DRw53 Are Not Truly Allelic 

Jack Gorski*, Pierre Rollini, and Bernard Mach 

pApiftmem of Mierobioloay, University ofGenwn Mwdical School, 1211 Genevq, Switzerland 



' Abstract. The organization and sequence of the HLA-DR^ 
chain genes are compared in the two supertypic groups. 
DRw52 and DRw53 . which together account for more than 
80 % of HLA-DR alleles. From the structural data, we con- 
! elude that these two groups represent distinct lineages 
; which have followed different patterns of evolution. The 
1 fine structure of the fl chain locus encoding the DRw53 
specificity corresponds most closely to the DR e II pseu- 
i dogene in the DRw52 haplotypes. Concomitantly, the 
DRfll locus in DRw53 haplotypes is more closely related to 
both of the two expressed DR 0 loci of the DRw52 haplo- 
types (DRqI and DR$ III). These two loci are the result of 
5 arecent duplication. Thisleads to the proposal that both ex- 
pressed DRfj chain genes in the DRw52 haplotypes (DRpI 
and DR e IJI) are derived from a single precursor locus, 
while the two loci expressed in the DRw53 haplotypes are 
| derived from distinct ancestral loci. The genes encoding 
< DRw52 and DRw53 are therefore not true alleles of the 
} same original locus. A scheme is proposed that accounts 
1 for the evolution of DR specificities within the DRw52 and 
= DRw53 groups ofhaplotypes.Ttis evidentthatihedifferent 
HLA-DR alleles are not structurally equidistant and that 
; one must take into consideration different degrees of heter- 
\ ozygosity or mismatch among the DR alleles. 



] Introduction 

t A striking feature of the major histocompatibility complex 
j (MHC) is its remarkable allelic polymorphism . This poly- 
morphism is responsible for the phenomenon of restriction 

: °f antigen presentation to T lymphocytes of the same 
; Optotype. The class II products of the MHC are involved 

to presentation of antigen to a subset of T lymphocytes. 

Different class II allelic products can present certain anti- 
I gens with either low or high efficiency, thus determining 

| * Present address: The Blood Ccmer of Southeastern Wisconsin, 
ft Milwaukee. Wisconsin 



the extent of the immune response (reviewed by Benaeerraf 
1981, Nagy et al. 1981 p Schwartz 1985). The biological 
significance of MHC class II polymorphism is the ability it 
confers to the population to cope with a variety of patho- 
gens and thus to survive as a species. 

The major class n product of the human MHC (HLA) 
is HLA-DR, Tbi9 transmembrane protein is composed of 
two chains, the a and 0 chains. The ot chain is encoded by 
a single locus (4) and shows no polymorphism. The 0 
chains , on the contrary P are highly polymorphic and are en- 
coded by several Cfl) loci (reviewed by Kaufman et al, 
1984, Rollini et al. 1985), The correlation between the 
products of each of these individual 0 chain loci and the ob- 
served phenotypic traits such as serologic reactivity, al- 
loreactive T-cell stimulation and restricted antigen presen- 
tation represents an important challenge. 

HLA-DR was first defined by serology. Sera directed 
against HLA-D/DR molecules often recognize the 
products of several different alleles and are thus designated 
as supertypic sera. They define groups of cross-reactive 
haplotypes called 1 'supertypic groups . ' ' Two major super- 
typic groups have been identified. The members of the 
DRw52 supertypic group, haplotypes DR3, DR5, DRw6, 
and DRw8, share the DRw52 specificity. The members of 
theDRw53 group, DR4,DR7 t tmdDRw9> share a specific- 
ity referred to as DRw53 (Bodmer and Bodmer 1984). 
Together these haplotypes account for more than 80% of 
known DR specificities. The epitopes recognized by super- 
typic sera could be encoded by a single DR & chain locus or 
by conserved regions shared by several £ chain loci. 

The identification of multiple HLA-DR 3 chain loci 
(Long et al. 1983) and the possibility of nonequal rates of 
divergence of these loci led us to propose that a supertypic 
epitope could be encoded by a less polymorphic locus 
shared among several haplotypes (Gorski et al. 1985). 
Analysis of one of the HLA-DR^ chain genes expressed in 
DNA-transfected mouse fibroblasts has shown directly 
that it enc des the DRw52 specificity (Gorski et al. 1985). 
This gene has now been identified as the DR Q III locus of a 
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DRw52 haplotype (Rollini et al . 1985) . In a DRw53 haplo- 
type, amino acid sequence analysis of the DRw53 reactive 
product and the nucleotide sequence of genes cloned from 
a DR4 haplotype have permitted the identification of one of 
the DRq chain loci as encoding DRw53 (Sorrentino et al. 
1985. Spies et al. 1985). It is generally assumed that 
DRw52 and DRw53 are encoded by alleles of the same lo- 
cus (Bach 1985). 

In this study, we have analyzed the structural relation- 
ship of the jS chain genes of the DRw52 and DRw53 super- 
typic groups at the level of fine restriction maps and of 
DNA sequence. We can conclude that the two expressed 
HLA-DR $ chain loci in the DRw52 haplotypes (DR$I and 
DRpHI) are homologous to the DR^I locus of the DRw53 
haplotypes, and that these three genes are presumably de- 
rived from a common ancestral locus, DR 8 L In contrast, 
the gene encoding the DRw53 specificity is distinct from 
these three genes and is most related to the ZM^/pseu- 
dogene of the DRw52 haplotypes. These two genes are 
presumably derived from another ancestral locus . The data 
suggest an evolutionary scheme for the HLA-DR$ chain 
region, where the genes encoding DRw52 and DRw53 are 
not alleles of the s ame ancestral locus . They also have prac- 
tical implications concerning the nature and the magnitude 
of the structural differences among different HLA-DR al- 
leles. 



Materials and Methods 

Isolation and nomenclature ofDR g chain genes. The isolation and mop- 
ping of the DR^ chain genomic clones for ihe DR3 andDRw6 haplotypes 
have been described (Rollini « al. 19S5J. cDNA clones Horn a DR4, W6 
cell line have also been published (Long el uL 1983). The DR4/DRw53 
genomic clone wu isolated from z phage library from the same DRA. u/6 
cell line. Four overlapping phage clones were i&oluied. The organization 
of ihe three DR3 B chain genes has been described (Rollini et al. 1985). 
These loci are numbered in the direction of transcription using Roman 
numerals. The terms DRw52 supenypic group and DRw53 superlypic 
group wo unedto dbfinc the cvolurionnrily related groups of haplotypes 
sharing th* DRwS2 and DRu/53 specificities* respectively. The terms 
DRwS2 locus and DRwS3 locus refer to the transcriptionally active locus 
which carries xhc DRw52 or DRw53 supenypic apecificky and not the DR 
specificity. In DRw52, where the Ihree DR$ loci have been linked, they 
are logically identified according to the gene order {DR$I, DR^Ix 
DRpffJ)- In the case oEDR4, however, nil 0 chain loci hflvp not yet been 

linked (Spies et el. 1985). and since four DR 0 chain loei have been identi- 
fied in certain DRw53 haplory pes (Btihmc et al. 1 98 5) , ihe locus encoding 
DRw53 might in face correspond 10 a DRpIV \ocvs (see Discussion). 

Oligonucleotide blot hybridization. The oligonucleotides, 19mor3, wore 
kindly provided by Dr. BL SCAusjuhixniL CBiogen S .A.) or synthesized using 

a Pharmacia Gene Assembler. The sequences which correspond to die 
different probes used are shown in Figure l. The experimental protocol 
for the u 3D of oligonucleotides under conditions of discrimination of single 
nucleotides ts published (Angelini et al, 1936;). 

Mapping of genomic clones. The restriction enzyme maps presented here 
correspond to a fine level structural analysis of the genes. Mapping ^vas 
performed by the classic single and double digestion techniques followed 



by Southern blot analysis to localize the exons by hybridization with 
cDNA probes. Position of the exons is bused on the maps of subclones 
used Tor sequencing che respective cxnns. 

Sequence analysis. The DR3. DRw6a. and DRw6b first-domain Sc _ 
quences used for the sequence comparison are published elsewhere (Gor. 
ski und Mach 1986) . The first domain sequences of the DRw53 gene from 
q DR4 haploiype was determined from the genomic PR4 clone and from 
'its corresponding cDNA (cDNA DR^lV in Long « al. 1983): Tt\t se- 
quence far the DIM fl l chain is from cDNA. clone DR^U. in Long ec al. 
1983 (B- Grubenmann et al. „ unpublished data). The some sequences have 
been recently published from other DR4 cell lines (Spies et al, 1955 
Gregersenetol. 1986a). 



Results 

Identification of the DRwSB locus by oligonucleotide 
Southern blot hybridization. The DRw53 locus has been 
identified by comparison of partial amino acid sequence of 
a protein precipitated with anti-DRw53 sera with the DNA 
sequence of a gene from a DR4 haplotype (Sorrentino et al. 
1985, Spies etal. 1985). We have determined the DNA se- 
quence of the two expressed DR4$ chain genes cloned 
from a DR4/w6 cell line and on the basis of these se- 
quences, prepared oligonucleotide probes [4/1 and 4/2) 
which allow the unambiguous identification of one or the 
other locus (Fig. 1). The use of such 19 bp long 
oligonucleotides as probes to identify specific DR loci or 
specific DR alleles has been described in detail (Angelini ei 
al. 1986). Probe 4/1 has been previously shown to hybri- 
dize only to DNA from DR4 individuals and thus identifies 
the corresponding sequence as that of the DR4 baploiypic 
locus (Angelini et al. 1986). in contrast, probe 4-2 hybri- 
dizes to DNA from ecus of the DR4 and ZWZhaplorypes, 
both members of the DRw53 supenypic group. Another 
pair oflocus-specific probes (4/3 ana 4/4) corresponding 
to another region of the first domain (see Fig, l) showed 
identical results (data not shown). Therefore, the 0 chain 
gene from which the sequences of probes 4/2 and 4/4 were 
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Fig, 2. Comparison of the maps of the 
three DRw6 loci with the map of the 
DR4/DRwS3 locus. Sites for the follow- 
ing restriction eazym&S are shown: B. 
Bum HI; H. Hind IE; Q, Xho I; R, Eco 
RI; 5, Sac I; X, Xba. Region* of high ho- 
mology arc boxed. The black boxes refer 
lo the caoru encoding, respectively; 1, 
firet domain: 2, second domain; 3, 3' UT 
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derived corresponds to the locus encoding the DRw53 su- 
pertypic specificity. 

7fo gene encoding the DRw53 specificity is more related to 
the DRpIl pseudagene than to any other DRq gtnes of (he 
DRw52 haplotypcs. In the DRw52 haplotypcs wc have es- 
tablished a linkage map of three DR$ chain loci referred to 
as DRpI t DRpIl, and DR $ IU (Rollini et al. 1985) . The DRpI 
and DRq/II loci arc expressed whereas the DR^II locus 
lacks the first-domain exon and is thus considered a pseu- 
dogene (Rollini et al. 1985, 1987). Detailed restriction 
maps were generated for the three DRw6$ chain loci. 

A detailed map of the DRwS3 locus identified above 
was also prepared. It corresponds well to the map of the 
gene identified as DRw53 on the basis of partial amino aeid 
sequence (Sorrentino et al. 1985). This DRw53 gene map 
was compared with maps of the three DR 0 chain loci from 
the DRv»6 haplotype, DR^l, DR^U, and DR^IU (Fig. 2). 
As can be seen from the comparison, the best homology be- 
tween the DRw53 locus and any of the DRw6 loci is be- 
tween DRw53 and DRw6 p IL The homology between the 
two restriction maps extends well downstream of the exons 
encoding the protein itself and covers a region known to 
contain repetitive sequences. Similarity of noncoding 
regions of DNA, which are not likely to be under selective 
pressure, is indicative of an ancestral relationship. 



?j The DR4pl locus is more related to the rwo expressed 

|;i DRw52 loci ihan ir is to the locus encoding DRwS3. The 

■'■ relationship between the genes of the DRw52 and DRw53 

Lj supertypic families w as farther analy 2ed by comparis on of 



nucleotide sequences. It had been shown earlier that DR 
polymorphism is mostly associated with the first domain of 
the polypeptide chain (Kaufman and Strominger 1982). 
The sequence of the first-domain exons of the DR4qI and 
Z>R4-DRyv53 loci are shown in Figure 1 . These were com- 
pared with the- first-domain exon sequences of the DRfJ 
and DRplII (DRw52) loci of ti\zDR3, DRw6a t and DRwGb 
haplotypes, all members of the DRw52 supertypic group. 
Table 1 shows the number of amino acid differences, with 
the numbers in parentheses corresponding to nucleotide 
differences . It is clear that the Z?£4p/locus is equally relat- 
ed to both the DR^I and DR p I2I loci of haplotypes within 
the DRw52 supertypic group and that it is more related to 
these loci than it is to the DR4-DRw53 locua. 

The distribution of allelic differences at locus DR^Iand at 
the supertypic locus DRpIlL Comparison of DNA se- 
quences from the first domain of DR p chain genes also 
permits an analysis of the distribution of allelic differ- 
ences. This analysis can shed light on which regions of the 
DRq chain may be involved in contact with antigen and/or 
the T-cell receptor. It also indicates; where polymorphic 
sites may influence recognition by serologic reagents. 
When all DR gene sequences are compared in a nonallelic 
manner and irrespective of supertypic boundaries, a pat- 
tern is seen (Figure 3 A) where several major regions of se- 
quence variability can be identified (bp 15-25; bp 65-100; 
bp 155-165; and bp 195-220). However, comparisons 
made at one given locus and within a supertypic family 
show a very different distribution of polymorphic sites - 
The simplest case is the series of DR4 alleles at the DR$I 
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Fig, 3 A-D. Ploiof distribution of polymorphic nucleotide sites in thefirw- 
dornaia encoding region, A Comparison of a number of ffiA-DRp chain 
genes irrespective of locus or supenypic group. The sequences used are 
from those presented here and those cited below as well as others previous- 
ly published (Tbnnelle ei al. 1995, Qustafsson ei al. 1984). B Comparison 
Of alleles at the DR$t locus of a number of DR4 cell lines (Cairns et al. 
1985, Grcgeraen et al. 19B6ti). C Comparison oTdlleles at the DRpf locus 
of a number u£ DRw52 haplatypca (Garaki and. Mach Tichcr ct si. 

1986). D Comparison of alleles at the DR $ fI/ locus of a number of DRw52 
haplotypes (Gorsld and Mach 1966. Didier et al. 1986). The height of the 
line indicates the frequency of the second most common nucleotide. An i'n- 
temtpted line indicates chat mare than two nucleotides arc found at that 
position and me length of each pan of the line corresponds to me frequency 

of one of the nucleotides 



locus (Fig. 3B). These alleles are all recognized ait DIM 
by sexology and are only distinguished by alloreactive 
T-cell reagents defining Da/ specificities . As has been 
pointed out previouidy (Cairns etal. 1085, Gregersenetal. 
1986a), these differences are found predominantly in the 
185-210 bp region. When the alleles at the DRpl locus of 
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different haplotypes of the DRw52 supertypic family are 
analyzed (Fig. 3C), the 185-2 10 bp region is also the most 
polymorphic segment, although some polymorphism is 
evident in the 155-165 bp region. In contrast and $urpri s . 
ingly, the alleles at the DR^III locus of the DRw52 super- 
typic group show a very different distribution pattern from 
those observed at the DR^l locus of either the DRw52 o r 
DRw53 supertypic group (Fig. 3D), with almost no vari a . 
rion in the 185-210 bp hypervariable segment. 

The l$-25 bp region is only polymorphic when the 
comparison includes different loci and different supertypic 
groups (Fig. 3A). It is not polymorphic when alleles within 
a supertypic family are compared (Fig. 3B S C, and D) . This 
indicates that the 15-25 bp region plays a role in distin- 
guishing loci and supenypic groups rather than in restrict 
ed T-cell recognition. 



Discussion 

The similarity of DR gene restriction maps in different 
haplotypes within an HLA-DR supertypic group and the 
concomitant differences across supertypic groups indicate 
that within such a supertypic group the different haplotypes 
are evolutionarily related. We have compared the structure 
of the differen t DR p chain genes within and across super- 
rypic groups in order to determine the relationship of the 
various loci to each other. Our first observation is that the 
gene encoding the DRw53 specificity has characteristic 
structural features which correspond more closely to the 
i3J2pZ7pseudogene in the DRm>52 haplotypes than to any of 
the active DR Q chain loci, including the DR p I locus of the 

DRw53 haplotypes. Comparison of first-domain se- 
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Fig. 4, Schematic representation of [he 
evolutionary tree of the PR g chain gene 
femily, A two-locus family is postulated 
as the mammalian ancestral arrange- 
ment. The DRw52 lineage (left) hft= un- 
dergone n deletion of moat of the ances- 
tral DRplI locus. The DRpf locus and 
promoter region of the DR^II locus were 
duplicated. These two events could have 
happened in opposite order or simultane- 
ously. In the DRwS 31 100050 the ancestrpJ 
Dfyt-DRpfl pair u«l« duplicated. 
Whether locus DR p fJJ has been deleted 
Or maintained will only be known when 
the linkage maps are available- Po'r 
in orphic variations were introduced ut 
The PRpT loci al a higher raw than st (he 
other loci, thus establishing haplorypic 
(DRgl) and supertypic (DRw52 and 
DRw53) loci 
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Srruccure of Laci Encoding HLA-DRw52 and w53 

quences of the active loci shows the corollary: the first lo- 
cus of DRw53 haplotype (DR4 0 J) is more closely related to 
either of the two active DRwSl loci (DR p l and DR^III) 
than it is to the DRw53 locus in its own haplotype. 

These observations indicate that the DRw53 and 
DRw52 loci are not alleles in the classical sense. Based on 
these structural data, a limited evolutionary scheme is pro- 
posed here to explain th e origin oiHLA-DR polymorphism 
in these two supenypic groups (Fig. 4). We propose two 
ancestral DR & chain genes, DR p I (pte-DRw52) in white in 
Figure 4, and DRpII (pre-DRw53) in black in Figure 4. In 
the DRw52 lineage (left), a deletion involved part of the 
DRgll locus and a duplication event took place involving 
the DRpI gene (in white in Fig. 4) and the remnant 
promoter region of the DR Q 11 locus. This resulted in the 
two highly homologous DR$1 and DR p IU genes and in the 
truncated DR^Jl pseudogene found in the DRw52 lineage 
(RoUini et al. 1985). We postulate that the current DRw53 
organization arose by an early gene duplication of the pair 
of ancestral DR 0 chain loci resulting in four loci (Fig. 4, 
right). The ancestral DR^II locus became a pseudogene 
early, and has undergone extensive mutation (Larhammar 
et al. 1985). The fate of the third locus in the DRw53 lin- 
eage is not yet known as the complete linkage map has not 
been completed (Spies et al. 1985). Since four DR 0 chain 
genes have been identified in certain DR&53 haplotypes 
(Bfihme et al. 1985), the DRw53 locus (black in Fig. 4) 
should probably be referred to as DRfJV. In both the 
DRw52 and w53 lineages, the different loci were submit- 
ted to variable mutation rates depending on their position 
on the chromosome, with the locus adjacent to the DR a lo- 
cus undergoing the least mutation. At this point, various al- 
leles at locus Z>iy (Fig. 4, bottom line) arose by different 
mechanisms , including gene conversion (Gorski and Mach 
1986). 

This evolutionary scheme accounts best for the struc- 
tural data presented here. It implies that the DRw53 locus 
and the DR $ mocus of the DRw52 family share a common 
ancestor and thus can be considered alleles (black boxes). 
The DR4pI locus and the DR ? I and DR 0 IU loci of the 
DRw52 family share a common ancestor (white boxes). 
More importantly, the origin of the supertypic locus in 
each lineage, DRw52 and DRw53 , is clearly different and 
thus the DRw52 and DRw53 loci cannot be considered al- 
leles in the classical sense. 

In the DRw53 lineage, the DRw53 locus is identical in 
DR4 and DR7 haplotypes which differ in their DR^J locus 
and in their Dw specificities (Caims ei al. 1 985, Gregersen 
et al. 1986a, b) . In the DRw52 lineage, three alleles of the 
less polymorphic locus, DRw52 or DR^llI, have recently 
been identified (Gorski and Mach 1986, and unpublished 
data). These alleles split the DRw52 group of haploiypes 
into the 52a, b, and c subgroups. Segregation studies using 
oligonucleotide probes specific for the DRw52a and 
DRw52b alleles further indicate that the haplotypic DR and 
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Dw specificities are not encoded by the DR^II locus but 
rather by the DR $ 1 locus (Gorski ct al. 1 987). 

The DNA sequence comparisons have allowed us to 
map the positions of allelic differences. When true allelic 
comparisons are made, within a supertypic group, the 
polymorphism of the DRpI locus is characterized by a 
"hypcrvariablc region" corresponding to bp 185-210 
(amino acids 67-77). It is of interest that this segment has 
been implicated as the site of micro-rccambination/gene 
conversion events , which directly affect the specificity and 
restriction of T-ccll recognition (Gorski and Mach 1986). 
Interestingly, allelic comparisons of the DR^ffi locus in 
theDRw52 supertypic group show a very different pattern 
of polymorphic sites, with little or no differences in the 
"hypcrvariablc region" at bp 185-210. This suggests that 
the mechanisms responsible for the allelic polymorphism 
at locus DRgI and locus DR P III are not the same. It also 
raises the question of the nature of T-ccll recogni don of the 
products of each of these loci and of whether these Two DR 
products play similar or different roles in the immune 
response, in particular in restricted antigen presentation to 
T cells. 

The differences in the structure of the two supenypic 
groups have major consequences for serological analysis. 
The DRw53 locus docs not have a corresponding allele in 
the DRw52 family. Therefore, heterozygosity within and 
between supertypic groups docs not imply the same degree 
of structural differences. For example, a DR3, DRw6a in- 
dividual is really DR3, DRw6a, DRw52a whereas a 
DR3.DR4 individual is DR3, DR4, DRw52a, DRw53. 
This difference in relative heterozygosity may prove to 
have important implications in the ability to respond to a 
wider selection of antigens and in the association of HLA 
classllwith disease susceptibility. More importantly, it in- 
dicates that in HLA matching for organ transplantation, all 
HLA-DR mismatches are not of identical magnitude and 
consequence, especially when the supertypic loci (DRw52 
and DRm>53) are taken into account. As shown in the exam- 
ple above, a single DR mismatch within a supertypic group 
can imply only a minor difference limited to the DR^I 
gene product, whereas any mismatch across supertypic 
groups involves the additional differences in the DR anti- 
gen encoded in the DRwS2 and DRw53 supertypic loci. 
We have therefore proposed (Mach et al, 1986) that in ad- 
dition to the number of mismatches, the nature of the mis- 
match might have important practical consequences in 
transplantation and should be taken into consideration. 
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